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1 Discovery of multi-le ve l rules and exceptions from a distributed database 
Ronan Pairceir, Sally McClean, Bryan Scotney 



August 2000 Proceedings of the sixth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: 1|?)pdf( 1 32.32 KB ) Additional Information: full cit at ion, referen ces, index terms 



Keywords: aggregates, distributed databases, exception discovery, multi-level statistical 
models, rule discovery, sufficient statistics 
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Research session: new ap plications: Hubble: an advanced dynamic folder tec hnology | 
for XML 

Ning Li, Joshua Hui, Hui-I Hsiao, Kevin S. Beyer 

August 2005 Proceedings of the 31st international conference on Very large data 
bases VLDB '05 

Publisher: VLDB Endowment 

Full text available: , g|pdf( 422.35 KB ) Additional Information: full citation , abstract , references , index ter ms 

A significant amount of information is stored in computer systems today, but people are 
struggling to manage their documents such that the information is easily found. XML is a 
de-facto standard for content publishing and data exchange. The proliferation of XML 
documents has created new challenges and opportunities for managing document 
collections. Existing technologies for automatically organizing document collections are 
either imprecise or based on only simple criteria. Since XML documents a ... 

Brave new to pics 1 : multimedia challenges for planetary scale applications: IrisNet: 
a n internet-scale .a rchitect ure f or multimedia sensors 

Jason Campbell, Phillip B. Gibbons, Suman Nath, Padmanabhan Pillai, Srinivasan Seshan, 
Rahul Sukthankar 

November 2005 Proceedings of the 13th annual ACM international conference on 
Multimedia MULTIMEDIA v 05 

Publisher: ACM Press 

Full text available: 'QpdfQ^ MB) Additional Information: full citation , abstract , references , index terms 

Most current sensor network research explores the use of extremely simple sensors on 
small devices called motes and focuses on over-coming the resource constraints of these 
devices. In contrast, our research explores the challenges of multimedia sensors and is 
motivated by the fact that multimedia devices, such as cameras, are rapidly becoming 
inexpensive, yet their use in a sensor network presents a number of unique challenges. 



For example, the data rates involved with multimedia sensors are or ... 
Keywords: multimedia sensors, sensor networks, wide-area sensing infrastructure 



4 Concise descriptions of subsets of str uctur ed sets 
Alberto O. Mendelzon, Ken Q. Pu 

June 2003 Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART 
symposium on Principles of database systems 

Publisher: ACM Press 

Full text available: ^) pdf(334 ? 1 2 KB) Additional Information: fulicjtatLon, abstract, references, index terms 

We study the problem of economical representation of subsets of structured sets, that is, 
sets equipped with a set cover. Given a structured set U, and a language L whose 
expressions define subsets of U, the problem of Minimum Description Length in L (L-MDL) 
is: "given a subset V of U t find a shortest string in L that defines t/".We show that the 
simple set cover is enough to model a number of realistic database structures. We focus 
o ... 

5 INEX r e ports: The Wiki pedia XML corpus 
vgtv Ludovic Denoyer, Patrick Gallinari 

V June 2006 ACM SIGIR Forum, volume 40 issue l 

Publisher: AGM Press 

Full text available:^ pdf( 446, 12 KB) Additional Information: full citation, abstract, index terms 

Wikipedia is a well known free content, multilingual encyclopedia written collaboratively 
by contributors around the world. Anybody can edit an article using a wiki markup 
language that offers a simplified alternative to HTML This encyclopedia is composed of 
millions of articles in different languages. 

6 Research articles and surveys: A nal y tical proce ssin g of XML d ocum ents: 
opportunities and challenges 
Rajesh R. Bordawekar, Christian A. Lang 
June 2005 ACM SIGMOD Record, volume 34 issue 2 

Publisher: ACM Press 

Full text available: ^|pdf ( 19 1 .4 2 KB ) Additional Information: full citation, abstract, ref ere nces, index terms 

Online Analytical Processing (OLAP) has been a valuable tool for analyzing trends in 
business information. While the multi-dimensional cube model used by OLAP is ideal for 
analyzing structured business data, it is not suitable for representing and analyzing 
complex semi-structured data, such as, XML documents. Need for analyzing XML 
documents is gaining urgency as XML has become the language of choice for data 
representation across a wide range of application domains. This paper describes a prop ... 



7 Web-enabled simulation technologies: web-enabled tools, languages, and 
applications; XML-based supply chain simulation modeling 
Dean C. Chatfield, Terry P. Harrison, Jack C. Hayya 

December 2004 Proceedings of the 36th conference on Winter simulation WSC '04 
Publisher: Winter Simulation Conference 

Full text available: ^.B.df(263,31J<B) Additional Information: fulLcitatipn, abstract, references 

We describe a different approach to using XML to support the simulation modeling of 
supply chains. Instead of using XML to specify the simulation constructs, as most previous 
approaches do, we utilize XML to describe the supply chain itself. The Supply Chain 
Modeling Language (SCML) is a general, reusable, platform and methodology independent 
standard for describing a supply chain's structure and logic. SCML is usable by analysts 
using many methodologies, including simulation. We describe a samp ... 




Research sessions: new styles of XML: Ap pro ximate XML query answers 
Neoklis Polyzotis, Minos Garofalakis, Yannis Ioannidis 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: ^ pdf( 260.73 KB ) Additional Information: full citation , abstract , references 

The rapid adoption of XML as the standard for data representation and exchange 
foreshadows a massive increase in the amounts of XML data collected, maintained, and 
queried over the Internet or in large corporate data-stores. Inevitably, this will result in 
the development of on-line decision support systems, where users and analysts 
interactively explore large XML data sets through a declarative query interface (e.g., 
XQuery or XSLT). Given the importance of remaining interactive, such on-line s ... 

Poster pa pers : X ML retrieval: what about using contextual relevance? 
Karen Sauvagnat, Lobna Hlaoua, Mohand Boughanem 

April 2006 Proceedings of the 2006 ACM symposium on Applied computing SAC '06 

Publisher: ACM Press 

Full text available: ^.pdf(307.98 KB) Additional Information: fuN citation, abstract, references 

The aim of this study is to evaluate the impact of context to better identify relevant 
elements in XML retrieval. Context is represented here by clues on whole document 
relevance. We represent context according to different points of view: by introducing 
document dimension while computing terms weights, by using document relevance when 
evaluating elements relevance or by ranking elements on document relevance. 
Experiments were undertaken on INEX collection, and results showed the interest of 
con ... 

Keywords: XML retrieval, contextual relevance, relevance propagation 



Knowledge engineering tools and techniques: Modeling o nt ologies for robotic 
environments 

Antonio Chella, Massimo Cossentino, Roberto Pirrone, Andrea Ruisi 

July 2002 Proceedings of the 14th international conference on Software engineering 
and knowledge engineering SEKE '02 

Publisher: ACM Press 

Full text available: pdf(2Q3. 7 3 KB ) Additional Information: ful l citatio n, abstract, references 

On the basis of a multiple abstraction levels specification process, we developed a 
representational model for environmental robotic knowledge through the definition of a 
set of ontologies using a multi perspective approach. A general ontological model for 
typical indoor environments has been first developed, followed by its specialization using 
an implementation perspective. Actual software implementation of the ontology has been 
obtained via a XML-based markup language, used to build a reposit ... 

Keywords: multi agent systems, ontologies, robotics 



Web-based simulation: SISCO: a supply chain simulation tool utilizin g silk™ and XML 
Dean C Chatfield, Terry P. Harrison, Jack C. Hayya 

December 2001 Proceedings of the 33nd conference on Winter simulation 
Publisher: IEEE Computer Society 

Full text available: « pdf( 276.14 KB ) Additional Information: full citation, abstract, references, cr^gs, index 
^ terms 

We discuss SISCO, the Simulator for Integrated Supply Chain Operations, a Java-based 
tool that simplifies supply chain simulation model development. SISCO maps supply chain 
descriptions stored in the XML-based Supply Chain Modeling Language (SCML) format to a 
set of supply chain "building blocks" developed with ThreadTec's Silk™ simulation classes. 



The resulting system combines the ease of a visual supply chain simulator, the power and 
flexibility of a full object-oriented programming lang ... 



1 2 A po we rf u I a nd SQ L-cp mpati bl e data, mod el a nd _q u e ryja n g u i a ge for OLA P 
Dennis Pedersen, Karsten Riis, Torben Bach Pedersen 

January 2002 Australian Computer Science Communications , Proceedings of the 

thirteenth Australasian database conference - Volume 5 ADC '02, volume 

24 Issue 2 

Publisher: Australian Computer Society, Inc., IEEE Computer Society Press 

Full text available: « pd« 1 .1 2 MB) Additlonal Information: full citation , abstract, references, citings, index 
^ terms 

In this paper we present the SQLA7 OLAP data model, formal algebra, and query language 
that, unlike current OLAP data models and languages, are both powerful, meaning that 
they support irregular dimension hierarchies, automatic aggregation of data, and correct 
aggregation of data, and SQL-compatible, allowing seamless integration with relational 
technology. We also consider the requirements to the data model posed by integration of 
OLAP data with external XML data. ... 

Keywords: OLAP, data integration, data models, multidimensional databases, query 
languages 



1 3 Research sessions: new styles of XML: Colorful XML: o ne hie rarchy isn't enough 
H. V. Jagadish, Laks V. S. Lakshmanan, Monica Scannapieco, Divesh Srivastava, Nuwee 

^ Wiwatwattana 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: *g| pdf(339.49 KB) Additional Information: full citation , abstract , references 

XML has a tree-structured data model, which is used to uniformly represent structured as 
well as semi-structured data, and also enable concise query specification in XQuery, via 
the use of its XPath (twig) patterns. This in turn can leverage the recently developed 
technology of structural join algorithms to evaluate the query efficiently. In this paper, we 
identify a fundamental tension in XML data modeling: (i) data represented as deep trees 
(which can make effective use of twig patterns) are o ... 

1 4 Extending TPC-W to allow for f i ne gra ined workl oad sp ec ifi ca tion 

# Christian Kurz, Carlo's Guerrero, Gunter Haring 
July 2005 Proceedings of the 5th international workshop on Software and 

performance WOSP '05 
Publisher: ACM Press 

Full text available: ^|.pdf(160.82 KB) Additional Information: fuii citation, abstract, references 

This paper presents a method to characterize workload from a web server logfile from a 
user perspective. The data obtained in this process is used to create workload for the TPC- 
W benchmark. 

Keywords: benchmark, capacity planning, user modeling, web performance 



15 An XML query engine for network-bou nd da ta 
Zachary G. Ives, A. Y. Halevy, D. S. Weld 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume ll Issue 4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: 'jg pdf( 35 1 .86 KB) Additional Information: full citation , abstrac t, citings, index terms 

XML has become the lingua franca for data exchange and integration across 
administrative and enterprise boundaries. Nearly all data providers are adding XML import 



or export capabilities, and standard XML Schemas and DTDs are being promoted for all 
types of data sharing. The ubiquity of XML has removed one of the major obstacles to 
integrating data from widely disparate sources - namely, the heterogeneity of data 
formats. However, general-purpose integration of data across the wide are a also re ... 

Keywords: Data integration, Data streams, Query processing, Web and databases, XML 



1 6 Web services and performance evaluation: A fine-grain ed replacement strate gy for 
^ XML query cache 

^ Li Chen, Song Wang, Elizabeth Cash, Burke Ryder, Ian Hobbs, Elke A. Rundensteiner 

November 2002 Proceedings of the 4th international workshop on Web information 

and data management 
Publisher: ACM Press 

Full text available: « pdf(225.22 KB) Additional ,nformation: M citation, abstract, references, citings, index 
^ term s 

Caching popular queries and reusing results of previously computed queries is one 
important query optimization technique, especially in modern distributed environments 
such as the WWW. Based on the recent proliferation of XML data and the emergence of 
the XQuery language, we are thus developing a query- based caching system for XQuery 
queries, called ACE-XQ. ACE-XQ applies innovative query containment and rewriting 
strategies to answer incoming user queries based on the cached XQueries, whenever ... 

Keywords: XML, XQuery, cache replacement strategy, query containment, query 
rewriting, semantic caching 



17 Moving up the foo d chain: su pporting e-commerce applications on databases 
^ Anant Jhingran 

December 2000 ACM SIGMOD Record, Volume 29 issue 4 

Publisher: ACM Press 

Full text available: ^.pdf(447.48 KB) Additional Information: full citation, abstra ct, citings, index terms 

Database systems have enjoyed a tremendous market because they have served many 
applications really well — transaction processing in the beginning, and then decision 
support. Today, with over 200% cumulative growth rate in certain segments of E- 
Commerce, it is clear that this new class of applications will be a strong driver for 
databases to grow, commercially, as well as from a Research perspective. This paper 
outlines some of the issues that I have learnt in dealing with E-Commerce applicatio ... 

1 8 Fre e-link topolo g y navi g ation on statistical table ob j ects: metadata schema and user 
i nterfa ce 

Xiangming Mu, Gary Marchionini 

May 2003 Proceedings of the 2003 annual national conference on Digital 
government research dg.o '03 

Publisher: Digital Government Research Center 

Full text available: ^pdf(45,90 KB) Additional Information: full citation, abstract, references 

A table object, defined and described in XML, is treated as the basic unit in a free-link 
topology that enables multiple inheritances for each node. In this paper, a novel table 
navigation system called TableHunter is introduced. Each table object is presented in the 
TableHunter as a node of a map structure, and supports a context+focus view. 



19 Scaling up the semantic web: On labeling sche mes for the semantic web 

A Vassilis Christophides, Dimitris Plexousakis, Michel Scholl, Sotirios Tourtounis 

>^ May 2003 Proceedings of the 12th international conference on World Wide Web 

Publisher: ACM Press 

Full text available: l g) pdf( 294.32 KB ) Additional Information: full citation , abstract , references , citings, index 



terms 



This paper focuses on the optimization of the navigation through voluminous subsumption 
hierarchies of topics employed by Portal Catalogs like Netscape Open Directory (ODP). We 
advocate for the use of labeling schemes for modeling these hierarchies in order to 
efficiently answer queries such as subsumption check, descendants, ancestors or nearest 
common ancestor, which usually require costly transitive closure computations. We first 
give a qualitative comparison of three main families of schemes ... 

20 fast algorithm for creating space efficient dispatching tables with ap plicati on to multi- Q 
^ dispatching 

^ Yoav Zibin, Joseph Yossi Gil » 
November 2002 ACM SIGPLAN Notices , Proceedings of the 17th ACM SIGPLAN 

conference on Object-oriented programming, systems, languages, 

and applications OOPSLA '02, volume 37 issue n 
Publisher: ACM Press 

Full text available:^ pdf(31 2. 23 KB ) Additional Information: full citation , abstract , refer ences , citings 

The dispatching problem can be solved very efficiently in the single-inheritance~(SI) 
setting. In this paper we show how to extend one such solution to the multiple- 
inheritance~(MI) setting. This generalization comes with an increase to the space 
requirement by a small factor of k This factor can be thought of as a metric of the 
complexity of the topology of the inheritance hierarchy. On a data set of~35 hierarchies 
totaling some~64 thousand types, our dispatching data structure, based on a ... 
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ORDPATHs: insert-friendly XML node labels - group of 8 _» 

PO'Neil, EO'Neil, S Pal, I Cseri, G Schaller, N ... - Proceedings of the 2004 ACM SIGMOD international conference .... 2004 - 
portal.acm.org 

... locate nodes on all XPATH axes of hierarchy and precedence ... fan-out is very common 
with the XML trees of ... possible to base the scheme on statistics of trees for a ... 

Cited by 60 - Related. Articles - Web Seajxh 

The ICS-FORTH RDFSuite: Managing Voluminous RDF Description Bases - group of 23 » 

S Alexaki, V Christophides, G Karvounarakis, D ... - 2nd International Workshop on the Semantic Web, 2001 - 139.91.183.30 

... Table 1 : ODP hierarchy statistics. ... Last but not least, semistructured or XML models 

can't distinguish between entity (eg, ExtResource) and property labels (eg ... 

Cited by 1 1 9 - Related Articles - Cached - Web Search 

A fine-grained replacement strategy for XML query cache - group of 5 » 

L Chen, S Wang, E Cash, B Ryder, I Hobbs, EA . . . - Proceedings of the fourth international workshop on Web . . . ; 2002 - 
portal.acm.org 

... contains or partially overlaps a new query, the utility statistics of those ... with 
the extra expressiveness of pattern matching based on XML hierarchy and result ... 

Cited by 7 - Related Articles - Web Search 

Xaggregation: Flexible aggregat ion of XML data 

H WANG, J LI, Z HE, H GAO - Lecture notes in computer science - cat.inist.fr 

... With Xaggregation, statistics of XML data becomes more flexible with function of 

aggregating heterogeneous data and hierarchy data along some path of XMI 

Cited by 2 - Related Articles - Web Search - BL Direct 

Querying XML data sources in DB2: the XML Wrapper - grou p of 2 » • 

V Josifovski, P Schwarz - ieeexplore.ieee.org 

... to choose a good join order are derived from statistics about table ... the nickname 
that corresponds to the © customer elements of the XML hierarchy of Figure 1 ... 

Cited by 2 - Related Articles - Web.Search - BLJDjrect 

Indexing and querying XML data for regular path expressions - group of 43 » 
Q Li, B Moon - Proceedings of the 27th International Conference on Very 2001 - gdit.iiit.net 
... This numbering scheme quickly determines the ancestor-descendant relationship 
between ele- ments in the hierarchy of XML data. We ... 

Cited by 415 - Related Articles - View as HTML - Web Search - BL Direct 

XML and Object-Relational Database Systems - group of 1 7 » 
M Klettke, H Meyer - Proceedings of the 3 rdACM International Conference on the ... - Springer 
... The statistics are derived from sample XML document sets and some knowledge ... XML 
documents based on DTD s. The element hierarchy and attributes are represented ... 

Cited by 87 - Related Articles - Web Search - BL Direct 
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L Denoyer, P Gallinari - ACM SIGIR Forum, 2006 - portal.acm.org 

... The documents of the wikipedia XML collections are organized in a hierarchy of ... the 

hierarchy of categories ... Table 3 gives statistics about the categories. ... 
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M Smith, A Khotanzad - Proceedings of the International Conference on Information .... 2005 - doi.ieeecomputersociety.org 

... color and texture) are then inserted into the XML hierarchy representation of the ... 

2. Statistics are extracted from each color cluster - the mean, standard ... 

Related Articles - Web Search 

Stored Procedures for Distributed XML Databases - group of 2 » 
S Chen, PB Gibbons, S Nath - intel-research.net 

... input XPATH of a stored function may be an on-demand stored query invocation; in 
other words, an on-demand stored query higher in the XML hierarchy may process ... 

Cited by 3 - Related Articles - View as HTML - Web Search 
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